Probabilistic Linear Tree
نویسنده
چکیده
DNF systems divide the input space into decision surfaces, defined by conjunctions of conditions based on attribute-value pairs. This means that the decision surfaces are orthogonal to the axis of the tested attribute and parallel to all the other axes. In this paper, we present the system Ltree, that is able to define decision surfaces both orthogonal and oblique to the axes defined by the attributes of the input space. This is done by combining a decision tree with a linear discriminant by means of constructive induction. At each decision node Ltree defines a new instance space by the insertion of new attributes. Those are projections of the instances that fall at this node over the hyper-planes given by a linear discriminant function. This new instance space propagates downwards through the tree. Tests based on those new attributes are oblique in the input space. We have carried out experiments on sixteen benchmark datasets and compared our system with other well known decision tree systems (oblique and orthogonal) like C4.5, Oc1, and LMDT. From these experiments, we claim that our system has advantages concerning accuracy and tree size at statistically significant confidence levels.
منابع مشابه
Extension of Cube Attack with Probabilistic Equations and its Application on Cryptanalysis of KATAN Cipher
Cube Attack is a successful case of Algebraic Attack. Cube Attack consists of two phases, linear equation extraction and solving the extracted equation system. Due to the high complexity of equation extraction phase in finding linear equations, we can extract nonlinear ones that could be approximated to linear equations with high probability. The probabilistic equations could be considered as l...
متن کاملStudying impressive parameters on the performance of Persian probabilistic context free grammar parser
In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...
متن کاملProbVerus: Probabilistic Symbolic Model Checking
Model checking can tell us whether a system is correct; probabilistic model checking can also tell us whether a system is timely and reliable. Moreover, probabilistic model checking allows one to verify properties that may not be true with probability one, but may still hold with an acceptable probability. The challenge in developing a probabilistic model checker able to handle realistic system...
متن کاملA Trust Based Probabilistic Method for Efficient Correctness Verification in Database Outsourcing
Correctness verification of query results is a significant challenge in database outsourcing. Most of the proposed approaches impose high overhead, which makes them impractical in real scenarios. Probabilistic approaches are proposed in order to reduce the computation overhead pertaining to the verification process. In this paper, we use the notion of trust as the basis of our probabilistic app...
متن کاملSymbolic Model Checking for Probabilistic Processes
We introduce a symbolic model checking procedure for Probabilistic Computation Tree Logic PCTL over labelled Markov chains as models. Model checking for probabilistic logics typically involves solving linear equation systems in order to ascertain the probability of a given formula holding in a state. Our algorithm is based on the idea of representing the matrices used in the linear equation sys...
متن کاملProbabilistic analysis of the asymmetric digital search trees
In this paper, by applying three functional operators the previous results on the (Poisson) variance of the external profile in digital search trees will be improved. We study the profile built over $n$ binary strings generated by a memoryless source with unequal probabilities of symbols and use a combinatorial approach for studying the Poissonized variance, since the probability distribution o...
متن کامل